[INFO] cloning repository https://github.com/Nu11ified/picochat
[INFO] running `Command { std: "git" "-c" "credential.helper=" "-c" "credential.helper=/workspace/cargo-home/bin/git-credential-null" "clone" "--bare" "https://github.com/Nu11ified/picochat" "/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat", kill_on_drop: false }`
[INFO] [stderr] Cloning into bare repository '/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat'...
[INFO] running `Command { std: "git" "rev-parse" "HEAD", kill_on_drop: false }`
[INFO] [stdout] b1612dd8ce896eaf12bfc50b3df34abde51cf36e
[INFO] testing Nu11ified/picochat against master#f9988fefd3add01f414f52b414308e7872622fee for pr-155114
[INFO] running `Command { std: "git" "clone" "/workspace/cache/git-repos/https%3A%2F%2Fgithub.com%2FNu11ified%2Fpicochat" "/workspace/builds/worker-0-tc1/source", kill_on_drop: false }`
[INFO] [stderr] Cloning into '/workspace/builds/worker-0-tc1/source'...
[INFO] [stderr] done.
[INFO] started tweaking git repo https://github.com/Nu11ified/picochat
[INFO] finished tweaking git repo https://github.com/Nu11ified/picochat
[INFO] tweaked toml for git repo https://github.com/Nu11ified/picochat written to /workspace/builds/worker-0-tc1/source/Cargo.toml
[INFO] validating manifest of git repo https://github.com/Nu11ified/picochat on toolchain f9988fefd3add01f414f52b414308e7872622fee
[INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "metadata" "--manifest-path" "Cargo.toml" "--no-deps", kill_on_drop: false }`
[INFO] crate git repo https://github.com/Nu11ified/picochat already has a lockfile, it will not be regenerated
[INFO] running `Command { std: CARGO_HOME="/workspace/cargo-home" RUSTUP_HOME="/workspace/rustup-home" "/workspace/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "fetch" "--manifest-path" "Cargo.toml", kill_on_drop: false }`
[INFO] [stderr]     Updating crates.io index
[INFO] [stderr]  Downloading crates ...
[INFO] [stderr]   Downloaded safetensors v0.4.5
[INFO] [stderr]   Downloaded dyn-stack-macros v0.1.3
[INFO] [stderr]   Downloaded bindgen_cuda v0.1.6
[INFO] [stderr]   Downloaded ug-metal v0.1.0
[INFO] [stderr]   Downloaded dyn-stack v0.13.2
[INFO] [stderr]   Downloaded ug-cuda v0.1.0
[INFO] [stderr]   Downloaded candle-kernels v0.8.4
[INFO] [stderr]   Downloaded candle-nn v0.8.4
[INFO] [stderr]   Downloaded ug v0.1.0
[INFO] [stderr]   Downloaded zip v1.1.4
[INFO] [stderr]   Downloaded zerocopy-derive v0.8.40
[INFO] [stderr]   Downloaded fancy-regex v0.14.0
[INFO] [stderr]   Downloaded candle-metal-kernels v0.8.4
[INFO] [stderr]   Downloaded zerocopy v0.8.40
[INFO] [stderr]   Downloaded raw-cpuid v11.6.0
[INFO] [stderr]   Downloaded candle-core v0.8.4
[INFO] [stderr]   Downloaded cudarc v0.13.9
[INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "metadata" "--no-deps" "--format-version=1", kill_on_drop: false }`
[INFO] [stdout] d4b63651784911fdccf945d0992dffd181f0e71f4f1e50bcd0e21630a672b525
[INFO] running `Command { std: "docker" "start" "-a" "d4b63651784911fdccf945d0992dffd181f0e71f4f1e50bcd0e21630a672b525", kill_on_drop: false }`
[INFO] running `Command { std: "docker" "inspect" "d4b63651784911fdccf945d0992dffd181f0e71f4f1e50bcd0e21630a672b525", kill_on_drop: false }`
[INFO] running `Command { std: "docker" "rm" "-f" "d4b63651784911fdccf945d0992dffd181f0e71f4f1e50bcd0e21630a672b525", kill_on_drop: false }`
[INFO] [stdout] d4b63651784911fdccf945d0992dffd181f0e71f4f1e50bcd0e21630a672b525
[INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "build" "--frozen" "--message-format=json", kill_on_drop: false }`
[INFO] [stdout] 09cf221ffd77906a4a566f5e3be976512f5915b2beb64c6fd4b6b28841278ffd
[INFO] running `Command { std: "docker" "start" "-a" "09cf221ffd77906a4a566f5e3be976512f5915b2beb64c6fd4b6b28841278ffd", kill_on_drop: false }`
[INFO] [stderr]    Compiling libc v0.2.182
[INFO] [stderr]    Compiling libm v0.2.16
[INFO] [stderr]    Compiling zerocopy v0.8.40
[INFO] [stderr]    Compiling num-traits v0.2.19
[INFO] [stderr]    Compiling getrandom v0.3.4
[INFO] [stderr]    Compiling quote v1.0.44
[INFO] [stderr]    Compiling crossbeam-utils v0.8.21
[INFO] [stderr]    Compiling reborrow v0.5.5
[INFO] [stderr]    Compiling rayon-core v1.13.0
[INFO] [stderr]    Compiling syn v2.0.117
[INFO] [stderr]    Compiling seq-macro v0.3.6
[INFO] [stderr]    Compiling either v1.15.0
[INFO] [stderr]    Compiling pulp v0.21.5
[INFO] [stderr]    Compiling bitflags v2.11.0
[INFO] [stderr]    Compiling dyn-stack-macros v0.1.3
[INFO] [stderr]    Compiling crossbeam-epoch v0.9.18
[INFO] [stderr]    Compiling raw-cpuid v11.6.0
[INFO] [stderr]    Compiling ahash v0.8.12
[INFO] [stderr]    Compiling raw-cpuid v10.7.0
[INFO] [stderr]    Compiling iana-time-zone v0.1.65
[INFO] [stderr]    Compiling crossbeam-deque v0.8.6
[INFO] [stderr]    Compiling arrow-schema v53.4.1
[INFO] [stderr]    Compiling tracing-core v0.1.36
[INFO] [stderr]    Compiling rayon v1.11.0
[INFO] [stderr]    Compiling rand_core v0.9.5
[INFO] [stderr]    Compiling hashbrown v0.15.5
[INFO] [stderr]    Compiling winnow v0.7.14
[INFO] [stderr]    Compiling hashbrown v0.16.1
[INFO] [stderr]    Compiling serde_json v1.0.149
[INFO] [stderr]    Compiling num-integer v0.1.46
[INFO] [stderr]    Compiling chrono v0.4.39
[INFO] [stderr]    Compiling num-bigint v0.4.6
[INFO] [stderr]    Compiling num-iter v0.1.45
[INFO] [stderr]    Compiling indexmap v2.13.0
[INFO] [stderr]    Compiling toml_datetime v0.7.5+spec-1.1.0
[INFO] [stderr]    Compiling lexical-util v1.0.7
[INFO] [stderr]    Compiling regex-syntax v0.8.10
[INFO] [stderr]    Compiling zip v1.1.4
[INFO] [stderr]    Compiling lexical-write-integer v1.0.6
[INFO] [stderr]    Compiling lexical-parse-integer v1.0.6
[INFO] [stderr]    Compiling semver v1.0.27
[INFO] [stderr]    Compiling num-rational v0.4.2
[INFO] [stderr]    Compiling toml_parser v1.0.9+spec-1.1.0
[INFO] [stderr]    Compiling parking_lot_core v0.9.12
[INFO] [stderr]    Compiling rustc_version v0.4.1
[INFO] [stderr]    Compiling lexical-write-float v1.0.6
[INFO] [stderr]    Compiling lexical-parse-float v1.0.6
[INFO] [stderr]    Compiling crc32fast v1.5.0
[INFO] [stderr]    Compiling anyhow v1.0.102
[INFO] [stderr]    Compiling memmap2 v0.9.10
[INFO] [stderr]    Compiling synstructure v0.13.2
[INFO] [stderr]    Compiling toml_edit v0.23.10+spec-1.0.0
[INFO] [stderr]    Compiling getrandom v0.2.17
[INFO] [stderr]    Compiling errno v0.3.14
[INFO] [stderr]    Compiling num_cpus v1.17.0
[INFO] [stderr]    Compiling libloading v0.8.9
[INFO] [stderr]    Compiling comfy-table v7.2.2
[INFO] [stderr]    Compiling rand_core v0.6.4
[INFO] [stderr]    Compiling signal-hook-registry v1.4.8
[INFO] [stderr]    Compiling lexical-core v1.0.6
[INFO] [stderr]    Compiling parking_lot v0.12.5
[INFO] [stderr]    Compiling proc-macro-crate v3.4.0
[INFO] [stderr]    Compiling flatbuffers v24.12.23
[INFO] [stderr]    Compiling atoi v2.0.0
[INFO] [stderr]    Compiling socket2 v0.6.2
[INFO] [stderr]    Compiling mio v1.1.1
[INFO] [stderr]    Compiling regex-automata v0.4.14
[INFO] [stderr]    Compiling http v1.4.0
[INFO] [stderr]    Compiling bit-vec v0.8.0
[INFO] [stderr]    Compiling slab v0.4.12
[INFO] [stderr]    Compiling snap v1.1.1
[INFO] [stderr]    Compiling zerocopy-derive v0.8.40
[INFO] [stderr]    Compiling bytemuck_derive v1.10.2
[INFO] [stderr]    Compiling serde_derive v1.0.228
[INFO] [stderr]    Compiling tracing-attributes v0.1.31
[INFO] [stderr]    Compiling thiserror-impl v1.0.69
[INFO] [stderr]    Compiling zerofrom-derive v0.1.6
[INFO] [stderr]    Compiling num_enum_derive v0.7.5
[INFO] [stderr]    Compiling bytemuck v1.25.0
[INFO] [stderr]    Compiling tracing v0.1.44
[INFO] [stderr]    Compiling yoke-derive v0.7.5
[INFO] [stderr]    Compiling thiserror v1.0.69
[INFO] [stderr]    Compiling num-complex v0.4.6
[INFO] [stderr]    Compiling dyn-stack v0.13.2
[INFO] [stderr]    Compiling num v0.4.3
[INFO] [stderr]    Compiling pulp v0.18.22
[INFO] [stderr]    Compiling dyn-stack v0.10.0
[INFO] [stderr]    Compiling zerofrom v0.1.6
[INFO] [stderr]    Compiling num_enum v0.7.5
[INFO] [stderr]    Compiling displaydoc v0.2.5
[INFO] [stderr]    Compiling tokio-macros v2.6.1
[INFO] [stderr]    Compiling yoke v0.7.5
[INFO] [stderr]    Compiling futures-macro v0.3.32
[INFO] [stderr]    Compiling tokio v1.49.0
[INFO] [stderr]    Compiling futures-util v0.3.32
[INFO] [stderr]    Compiling http-body v1.0.1
[INFO] [stderr]    Compiling bit-set v0.8.0
[INFO] [stderr]    Compiling regex v1.12.3
[INFO] [stderr]    Compiling fancy-regex v0.14.0
[INFO] [stderr]    Compiling serde v1.0.228
[INFO] [stderr]    Compiling ordered-float v2.10.1
[INFO] [stderr]    Compiling safetensors v0.4.5
[INFO] [stderr]    Compiling integer-encoding v3.0.4
[INFO] [stderr]    Compiling twox-hash v1.6.3
[INFO] [stderr]    Compiling thrift v0.17.0
[INFO] [stderr]    Compiling picochat-tokenizer v0.1.0 (/opt/rustwide/workdir/crates/picochat-tokenizer)
[INFO] [stderr]    Compiling unicase v2.9.0
[INFO] [stderr]    Compiling http-body-util v0.1.3
[INFO] [stderr]    Compiling mime_guess v2.0.5
[INFO] [stderr]    Compiling picochat-tool v0.1.0 (/opt/rustwide/workdir/crates/picochat-tool)
[INFO] [stderr]    Compiling async-trait v0.1.89
[INFO] [stderr]    Compiling anstyle-query v1.1.5
[INFO] [stderr]    Compiling anstream v0.6.21
[INFO] [stderr]    Compiling serde_urlencoded v0.7.1
[INFO] [stderr]    Compiling axum-macros v0.4.2
[INFO] [stderr]    Compiling serde_path_to_error v0.1.20
[INFO] [stderr]    Compiling ppv-lite86 v0.2.21
[INFO] [stderr]    Compiling futures-executor v0.3.32
[INFO] [stderr]    Compiling axum-core v0.4.5
[INFO] [stderr]    Compiling clap_lex v1.0.0
[INFO] [stderr]    Compiling clap_derive v4.5.55
[INFO] [stderr]    Compiling clap_builder v4.5.60
[INFO] [stderr]    Compiling rand_chacha v0.9.0
[INFO] [stderr]    Compiling rand_chacha v0.3.1
[INFO] [stderr]    Compiling futures v0.3.32
[INFO] [stderr]    Compiling rand v0.9.2
[INFO] [stderr]    Compiling rand v0.8.5
[INFO] [stderr]    Compiling rand_distr v0.5.1
[INFO] [stderr]    Compiling hyper v1.8.1
[INFO] [stderr]    Compiling tower v0.5.3
[INFO] [stderr]    Compiling tokio-util v0.7.18
[INFO] [stderr]    Compiling half v2.7.1
[INFO] [stderr]    Compiling tower-http v0.5.2
[INFO] [stderr]    Compiling gemm-common v0.18.2
[INFO] [stderr]    Compiling arrow-buffer v53.4.1
[INFO] [stderr]    Compiling gemm-common v0.17.1
[INFO] [stderr]    Compiling gemm-f32 v0.18.2
[INFO] [stderr]    Compiling gemm-c64 v0.18.2
[INFO] [stderr]    Compiling gemm-c32 v0.18.2
[INFO] [stderr]    Compiling gemm-f64 v0.18.2
[INFO] [stderr]    Compiling arrow-data v53.4.1
[INFO] [stderr]    Compiling gemm-f32 v0.17.1
[INFO] [stderr]    Compiling arrow-array v53.4.1
[INFO] [stderr]    Compiling gemm-f16 v0.18.2
[INFO] [stderr]    Compiling gemm v0.18.2
[INFO] [stderr]    Compiling gemm-f16 v0.17.1
[INFO] [stderr]    Compiling gemm-c64 v0.17.1
[INFO] [stderr]    Compiling gemm-f64 v0.17.1
[INFO] [stderr]    Compiling gemm-c32 v0.17.1
[INFO] [stderr]    Compiling ug v0.1.0
[INFO] [stderr]    Compiling gemm v0.17.1
[INFO] [stderr]    Compiling hyper-util v0.1.20
[INFO] [stderr]    Compiling arrow-select v53.4.1
[INFO] [stderr]    Compiling candle-core v0.8.4
[INFO] [stderr]    Compiling arrow-row v53.4.1
[INFO] [stderr]    Compiling arrow-arith v53.4.1
[INFO] [stderr]    Compiling axum v0.7.9
[INFO] [stderr]    Compiling arrow-cast v53.4.1
[INFO] [stderr]    Compiling arrow-string v53.4.1
[INFO] [stderr]    Compiling arrow-ord v53.4.1
[INFO] [stderr]    Compiling clap v4.5.60
[INFO] [stderr]    Compiling arrow-ipc v53.4.1
[INFO] [stderr]    Compiling arrow v53.4.1
[INFO] [stderr]    Compiling candle-nn v0.8.4
[INFO] [stderr]    Compiling parquet v53.4.1
[INFO] [stderr]    Compiling picochat-core v0.1.0 (/opt/rustwide/workdir/crates/picochat-core)
[INFO] [stderr]    Compiling picochat-engine v0.1.0 (/opt/rustwide/workdir/crates/picochat-engine)
[INFO] [stderr]    Compiling picochat-optim v0.1.0 (/opt/rustwide/workdir/crates/picochat-optim)
[INFO] [stderr]    Compiling picochat-data v0.1.0 (/opt/rustwide/workdir/crates/picochat-data)
[INFO] [stderr]    Compiling picochat-eval v0.1.0 (/opt/rustwide/workdir/crates/picochat-eval)
[INFO] [stderr]    Compiling picochat-train v0.1.0 (/opt/rustwide/workdir/crates/picochat-train)
[INFO] [stderr]    Compiling picochat-serve v0.1.0 (/opt/rustwide/workdir/crates/picochat-serve)
[INFO] [stderr]    Compiling picochat-cli v0.1.0 (/opt/rustwide/workdir/crates/picochat-cli)
[INFO] [stderr]     Finished `dev` profile [unoptimized + debuginfo] target(s) in 4m 30s
[INFO] running `Command { std: "docker" "inspect" "09cf221ffd77906a4a566f5e3be976512f5915b2beb64c6fd4b6b28841278ffd", kill_on_drop: false }`
[INFO] running `Command { std: "docker" "rm" "-f" "09cf221ffd77906a4a566f5e3be976512f5915b2beb64c6fd4b6b28841278ffd", kill_on_drop: false }`
[INFO] [stdout] 09cf221ffd77906a4a566f5e3be976512f5915b2beb64c6fd4b6b28841278ffd
[INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "test" "--frozen" "--no-run" "--message-format=json", kill_on_drop: false }`
[INFO] [stdout] d21c13d6e783f223ffc01811b5a9b98569d761f0a2286687e111ff375313597c
[INFO] running `Command { std: "docker" "start" "-a" "d21c13d6e783f223ffc01811b5a9b98569d761f0a2286687e111ff375313597c", kill_on_drop: false }`
[INFO] [stderr]    Compiling picochat-serve v0.1.0 (/opt/rustwide/workdir/crates/picochat-serve)
[INFO] [stderr]    Compiling picochat-train v0.1.0 (/opt/rustwide/workdir/crates/picochat-train)
[INFO] [stderr]    Compiling picochat-cli v0.1.0 (/opt/rustwide/workdir/crates/picochat-cli)
[INFO] [stderr]    Compiling picochat-eval v0.1.0 (/opt/rustwide/workdir/crates/picochat-eval)
[INFO] [stderr]    Compiling picochat-data v0.1.0 (/opt/rustwide/workdir/crates/picochat-data)
[INFO] [stderr]    Compiling picochat-optim v0.1.0 (/opt/rustwide/workdir/crates/picochat-optim)
[INFO] [stderr]    Compiling picochat-engine v0.1.0 (/opt/rustwide/workdir/crates/picochat-engine)
[INFO] [stderr]    Compiling picochat-tool v0.1.0 (/opt/rustwide/workdir/crates/picochat-tool)
[INFO] [stderr]    Compiling picochat-tokenizer v0.1.0 (/opt/rustwide/workdir/crates/picochat-tokenizer)
[INFO] [stderr]    Compiling picochat-core v0.1.0 (/opt/rustwide/workdir/crates/picochat-core)
[INFO] [stderr]     Finished `test` profile [unoptimized + debuginfo] target(s) in 45.11s
[INFO] running `Command { std: "docker" "inspect" "d21c13d6e783f223ffc01811b5a9b98569d761f0a2286687e111ff375313597c", kill_on_drop: false }`
[INFO] running `Command { std: "docker" "rm" "-f" "d21c13d6e783f223ffc01811b5a9b98569d761f0a2286687e111ff375313597c", kill_on_drop: false }`
[INFO] [stdout] d21c13d6e783f223ffc01811b5a9b98569d761f0a2286687e111ff375313597c
[INFO] running `Command { std: "docker" "create" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/target:/opt/rustwide/target:rw,Z" "-v" "/var/lib/crater-agent-workspace/builds/worker-0-tc1/source:/opt/rustwide/workdir:ro,Z" "-v" "/var/lib/crater-agent-workspace/cargo-home:/opt/rustwide/cargo-home:ro,Z" "-v" "/var/lib/crater-agent-workspace/rustup-home:/opt/rustwide/rustup-home:ro,Z" "-e" "SOURCE_DIR=/opt/rustwide/workdir" "-e" "CARGO_TARGET_DIR=/opt/rustwide/target" "-e" "CARGO_INCREMENTAL=0" "-e" "RUST_BACKTRACE=full" "-e" "RUSTFLAGS=--cap-lints=forbid" "-e" "RUSTDOCFLAGS=--cap-lints=forbid" "-e" "CARGO_HOME=/opt/rustwide/cargo-home" "-e" "RUSTUP_HOME=/opt/rustwide/rustup-home" "-w" "/opt/rustwide/workdir" "-m" "1610612736" "--user" "0:0" "--network" "none" "ghcr.io/rust-lang/crates-build-env/linux@sha256:d429b63d4308055ea97f60fb1d3dfca48854a00942f1bd2ad806beaf015945ec" "/opt/rustwide/cargo-home/bin/cargo" "+f9988fefd3add01f414f52b414308e7872622fee" "test" "--frozen", kill_on_drop: false }`
[INFO] [stdout] 4e72ac9e64b4e8a3cf48c46c5d85e82ac7c3b5820107818548538845b2920f27
[INFO] running `Command { std: "docker" "start" "-a" "4e72ac9e64b4e8a3cf48c46c5d85e82ac7c3b5820107818548538845b2920f27", kill_on_drop: false }`
[INFO] [stderr]     Finished `test` profile [unoptimized + debuginfo] target(s) in 0.59s
[INFO] [stderr]      Running unittests src/main.rs (/opt/rustwide/target/debug/deps/picochat-cd3e9104f76ddbae)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_core-0b3814daca6b94e1)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/attention_test.rs (/opt/rustwide/target/debug/deps/attention_test-815f868a3972e636)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_attention_output_shape ... ok
[INFO] [stdout] test test_attention_causal_masking ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.88s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/config_test.rs (/opt/rustwide/target/debug/deps/config_test-1e41f886361c0fe8)
[INFO] [stdout] 
[INFO] [stdout] running 6 tests
[INFO] [stdout] test test_depth_12_config ... ok
[INFO] [stdout] test test_head_dim_consistent ... ok
[INFO] [stdout] test test_depth_26_gpt2_config ... ok
[INFO] [stdout] test test_depth_4_small_config ... ok
[INFO] [stdout] test test_window_sizes ... ok
[INFO] [stderr]      Running tests/init_test.rs (/opt/rustwide/target/debug/deps/init_test-526cb448f14b0673)
[INFO] [stdout] test test_padded_vocab_size ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 7 tests
[INFO] [stdout] test test_c_proj_init_to_zero has been running for over 60 seconds
[INFO] [stdout] test test_lm_head_init_narrow_normal has been running for over 60 seconds
[INFO] [stdout] test test_resid_lambdas_init_to_one has been running for over 60 seconds
[INFO] [stdout] test test_uniform_weights_in_range has been running for over 60 seconds
[INFO] [stdout] test test_ve_gate_init_to_zero has been running for over 60 seconds
[INFO] [stdout] test test_wte_init_normal has been running for over 60 seconds
[INFO] [stdout] test test_x0_lambdas_init_to_point_one has been running for over 60 seconds
[INFO] [stdout] test test_x0_lambdas_init_to_point_one ... ok
[INFO] [stdout] test test_ve_gate_init_to_zero ... ok
[INFO] [stdout] test test_wte_init_normal ... ok
[INFO] [stdout] test test_uniform_weights_in_range ... ok
[INFO] [stdout] test test_resid_lambdas_init_to_one ... ok
[INFO] [stdout] test test_lm_head_init_narrow_normal ... ok
[INFO] [stdout] test test_c_proj_init_to_zero ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 7 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 224.53s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/kv_cache_test.rs (/opt/rustwide/target/debug/deps/kv_cache_test-5ea21a140cbb3363)
[INFO] [stdout] 
[INFO] [stdout] running 6 tests
[INFO] [stdout] test test_kv_cache_new ... ok
[INFO] [stdout] test test_kv_cache_reset ... ok
[INFO] [stdout] test test_layer_cache_update ... ok
[INFO] [stdout] test test_training_forward_unchanged ... ok
[INFO] [stdout] test test_forward_with_cache_prefill ... ok
[INFO] [stdout] test test_forward_with_cache_decode ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 13.21s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/mlp_test.rs (/opt/rustwide/target/debug/deps/mlp_test-bffd0b1a86e23dcc)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_mlp_output_shape ... ok
[INFO] [stdout] test test_mlp_relu_squared_activation ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.09s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/model_test.rs (/opt/rustwide/target/debug/deps/model_test-0ec629e8a74a3d5d)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_gpt_depth4_small ... ok
[INFO] [stdout] test test_gpt_forward_with_targets ... ok
[INFO] [stdout] test test_gpt_forward_logits_shape ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 32.45s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/norm_test.rs (/opt/rustwide/target/debug/deps/norm_test-c1a3930a6f4d6524)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_rms_norm_shape_preserved ... ok
[INFO] [stdout] test test_rms_norm_unit_rms ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/rotary_test.rs (/opt/rustwide/target/debug/deps/rotary_test-825bd5cc0e2b0f0d)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_rotary_precompute_shapes ... ok
[INFO] [stdout] test test_rotary_offset_for_kv_cache ... ok
[INFO] [stdout] test test_apply_rotary_emb_shape ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.10s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_data-0a6007649e98cf8c)
[INFO] [stderr]      Running tests/arc_test.rs (/opt/rustwide/target/debug/deps/arc_test-993f6be372a9a4d7)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_arc_question_answer_index ... ok
[INFO] [stdout] test test_format_arc_prompt ... ok
[INFO] [stdout] test test_arc_question_parse ... ok
[INFO] [stdout] test test_load_arc_from_string ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/dataloader_test.rs (/opt/rustwide/target/debug/deps/dataloader_test-b51826a9859cd8d1)
[INFO] [stdout] 
[INFO] [stdout] running 10 tests
[INFO] [stdout] test test_dataset_len ... ok
[INFO] [stdout] test test_packing_long_document_splits ... ok
[INFO] [stdout] test test_packing_batch_returns_none_when_insufficient ... ok
[INFO] [stdout] test test_packing_bos_prepended ... ok
[INFO] [stdout] test test_packing_single_document ... ok
[INFO] [stdout] test test_packing_flush_pads ... ok
[INFO] [stdout] test test_dataset_empty ... ok
[INFO] [stdout] test test_packing_target_shift ... ok
[INFO] [stdout] test test_dataloader_batch_shape ... ok
[INFO] [stdout] test test_dataloader_target_is_shifted_input ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 10 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/mixture_test.rs (/opt/rustwide/target/debug/deps/mixture_test-a06c9e6c26694643)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_mixture_epoch_cycling ... ok
[INFO] [stdout] test test_mixture_single_dataset ... ok
[INFO] [stderr]      Running tests/parquet_test.rs (/opt/rustwide/target/debug/deps/parquet_test-7956c0881413673c)
[INFO] [stdout] test test_mixture_weighted_sampling ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.04s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_read_parquet_texts ... ok
[INFO] [stdout] test test_read_all_text ... ok
[INFO] [stdout] test test_missing_column_error ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.04s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/sft_test.rs (/opt/rustwide/target/debug/deps/sft_test-6b654673d479c473)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_mask_alignment_multi_turn ... ok
[INFO] [stdout] test test_mask_alignment_single_turn ... ok
[INFO] [stdout] test test_chat_message_parse ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/tool_data_test.rs (/opt/rustwide/target/debug/deps/tool_data_test-b96fb69a019cdc5e)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_format_tool_prompt ... ok
[INFO] [stdout] test test_tool_scenario_parse ... ok
[INFO] [stdout] test test_load_tool_scenarios_from_string ... ok
[INFO] [stdout] test test_tool_scenario_no_tool ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_engine-0c9bdc3a6b467510)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/generate_logprobs_test.rs (/opt/rustwide/target/debug/deps/generate_logprobs_test-93b7603c2a9e5716)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_logprobs_greedy_deterministic ... ok
[INFO] [stdout] test test_logprobs_with_stop_token ... ok
[INFO] [stdout] test test_logprobs_returns_ids_and_probs ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 15.61s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/generate_test.rs (/opt/rustwide/target/debug/deps/generate_test-68b9d41dfc5a466c)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_generate_stops_at_max_tokens ... ok
[INFO] [stdout] test test_generate_stops_at_stop_token ... ok
[INFO] [stdout] test test_generate_produces_tokens ... ok
[INFO] [stdout] test test_greedy_is_deterministic ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 17.40s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/quantize_test.rs (/opt/rustwide/target/debug/deps/quantize_test-6666c54b7a32ba48)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_scales_shape ... ok
[INFO] [stdout] test test_quantize_zeros ... ok
[INFO] [stdout] test test_quantize_large_values ... ok
[INFO] [stdout] test test_quantize_roundtrip ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/reasoning_test.rs (/opt/rustwide/target/debug/deps/reasoning_test-26830716ca69bc47)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stderr]      Running tests/sampling_test.rs (/opt/rustwide/target/debug/deps/sampling_test-7db45d826d219b4a)
[INFO] [stdout] test test_segment_text_extraction ... ok
[INFO] [stdout] test test_output_segment_equality ... ok
[INFO] [stdout] test test_output_segment_variants ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 5 tests
[INFO] [stdout] test test_default_params ... ok
[INFO] [stdout] test test_greedy_with_zero_temperature ... ok
[INFO] [stdout] test test_top_k_limits_candidates ... ok
[INFO] [stdout] test test_greedy_returns_argmax ... ok
[INFO] [stdout] test test_sample_respects_distribution ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_eval-8bf64001bee0220a)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stderr]      Running tests/arc_eval_test.rs (/opt/rustwide/target/debug/deps/arc_eval_test-5def1f8bc72222d0)
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 1 test
[INFO] [stdout] test test_arc_result_accuracy ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/bpb_test.rs (/opt/rustwide/target/debug/deps/bpb_test-b96ae2826abe0a76)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_bpb_formula ... ok
[INFO] [stdout] test test_bpb_result_fields ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/gsm8k_test.rs (/opt/rustwide/target/debug/deps/gsm8k_test-2d481d127123f6b1)
[INFO] [stdout] 
[INFO] [stdout] running 6 tests
[INFO] [stdout] test test_format_gsm_prompt ... ok
[INFO] [stdout] test test_extract_answer_none ... ok
[INFO] [stdout] test test_extract_answer_negative ... ok
[INFO] [stdout] test test_extract_answer_decimal ... ok
[INFO] [stdout] test test_extract_answer_with_comma ... ok
[INFO] [stderr]      Running tests/mmlu_test.rs (/opt/rustwide/target/debug/deps/mmlu_test-c3228bc234b15ac7)
[INFO] [stdout] test test_extract_answer_basic ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_format_mmlu_prompt ... ok
[INFO] [stdout] test test_pick_answer_from_logprobs ... ok
[INFO] [stdout] test test_pick_answer_tie_favors_first ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/reasoning_eval_test.rs (/opt/rustwide/target/debug/deps/reasoning_eval_test-4bd2cf79aa22aec0)
[INFO] [stdout] 
[INFO] [stdout] running 5 tests
[INFO] [stdout] test test_multiple_think_blocks ... ok
[INFO] [stdout] test test_reasoning_metrics_no_thinking ... ok
[INFO] [stdout] test test_reasoning_metrics_empty ... ok
[INFO] [stdout] test test_self_correction_patterns ... ok
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_optim-f303d4af84202780)
[INFO] [stdout] test test_reasoning_metrics_with_thinking ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/adamw_test.rs (/opt/rustwide/target/debug/deps/adamw_test-21b864b0aa753f11)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_adamw_weight_decay ... ok
[INFO] [stdout] test test_adamw_single_step_changes_params ... ok
[INFO] [stdout] test test_adamw_multiple_steps_converge ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/combined_test.rs (/opt/rustwide/target/debug/deps/combined_test-d32af6f82401afc2)
[INFO] [stdout] 
[INFO] [stdout] running 5 tests
[INFO] [stdout] test test_classify_params_by_name ... ok
[INFO] [stdout] test test_scaling_with_different_n_embd ... ok
[INFO] [stdout] test test_from_varmap_classifies_correctly ... ok
[INFO] [stdout] test test_combined_step_updates_all_params ... ok
[INFO] [stdout] test test_combined_with_lr_multiplier ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.03s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/muon_test.rs (/opt/rustwide/target/debug/deps/muon_test-d9c94ae95aa0f0e0)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_polar_express_tall_matrix ... ok
[INFO] [stdout] test test_polar_express_near_orthogonal ... ok
[INFO] [stdout] test test_muon_single_step ... ok
[INFO] [stdout] test test_muon_momentum_accumulates ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.03s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/schedule_test.rs (/opt/rustwide/target/debug/deps/schedule_test-3d18a4ac26bdfe8b)
[INFO] [stdout] 
[INFO] [stdout] running 5 tests
[INFO] [stdout] test test_warmdown_is_cosine ... ok
[INFO] [stdout] test test_constant_phase ... ok
[INFO] [stdout] test test_warmup_reaches_base_lr ... ok
[INFO] [stdout] test test_warmup_starts_at_zero ... ok
[INFO] [stdout] test test_warmdown_ends_at_zero ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_serve-06a75c09b4f9dfb9)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/serve_test.rs (/opt/rustwide/target/debug/deps/serve_test-88a82d3cf0d037e4)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_segment_to_type_mapping ... ok
[INFO] [stdout] test test_sse_payload_serialization ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_tokenizer-1d6444337e17f6d9)
[INFO] [stderr]      Running tests/bpe_test.rs (/opt/rustwide/target/debug/deps/bpe_test-82efdbe3d9c17d48)
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 6 tests
[INFO] [stdout] test test_gpt4_pattern_compiles ... ok
[INFO] [stdout] test test_train_merges_count ... ok
[INFO] [stdout] test test_merge_vocab_concatenation ... ok
[INFO] [stdout] test test_train_small_vocab ... ok
[INFO] [stdout] test test_vocab_has_byte_tokens ... ok
[INFO] [stdout] test test_pattern_splits_contractions ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.09s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/encode_test.rs (/opt/rustwide/target/debug/deps/encode_test-9d6f38858449c6cb)
[INFO] [stdout] 
[INFO] [stdout] running 8 tests
[INFO] [stdout] test test_encode_single_byte ... ok
[INFO] [stdout] test test_encode_decode_roundtrip ... ok
[INFO] [stdout] test test_encode_special_tokens ... ok
[INFO] [stdout] test test_unicode_roundtrip ... ok
[INFO] [stdout] test test_encode_empty_string ... ok
[INFO] [stdout] test test_adjacent_special_tokens ... ok
[INFO] [stdout] test test_encode_reduces_token_count ... ok
[INFO] [stdout] test test_save_load_roundtrip ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 8 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.24s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/special_test.rs (/opt/rustwide/target/debug/deps/special_test-3bb176ee0ecbea56)
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_tool-ec0ee334e9f4fb57)
[INFO] [stdout] 
[INFO] [stdout] running 6 tests
[INFO] [stdout] test test_all_strings_unique ... ok
[INFO] [stdout] test test_registry_ids_at_end_of_vocab ... ok
[INFO] [stdout] test test_special_token_count ... ok
[INFO] [stdout] test test_special_token_roundtrip_str ... ok
[INFO] [stdout] test test_registry_roundtrip_id ... ok
[INFO] [stdout] test test_registry_non_special_id_returns_none ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 6 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/ast_test.rs (/opt/rustwide/target/debug/deps/ast_test-16a48a4c1aecf1ba)
[INFO] [stdout] 
[INFO] [stdout] running 13 tests
[INFO] [stdout] test test_tokenize_comparison ... ok
[INFO] [stdout] test test_parse_operator_precedence ... ok
[INFO] [stdout] test test_parse_power ... ok
[INFO] [stdout] test test_tokenize_arithmetic ... ok
[INFO] [stdout] test test_parse_method_call ... ok
[INFO] [stdout] test test_parse_parenthesized ... ok
[INFO] [stdout] test test_parse_binary_arithmetic ... ok
[INFO] [stdout] test test_parse_function_call ... ok
[INFO] [stdout] test test_tokenize_function_call ... ok
[INFO] [stdout] test test_tokenize_negative_number ... ok
[INFO] [stdout] test test_tokenize_method_call ... ok
[INFO] [stdout] test test_tokenize_number ... ok
[INFO] [stdout] test test_tokenize_string ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 13 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/evaluator_test.rs (/opt/rustwide/target/debug/deps/evaluator_test-9ce7d299401acd02)
[INFO] [stdout] 
[INFO] [stdout] running 13 tests
[INFO] [stdout] test test_comparisons ... ok
[INFO] [stdout] test test_parse_error ... ok
[INFO] [stdout] test test_string_count ... ok
[INFO] [stdout] test test_division_by_zero ... ok
[INFO] [stdout] test test_string_len ... ok
[INFO] [stdout] test test_basic_arithmetic ... ok
[INFO] [stdout] test test_string_upper_lower ... ok
[INFO] [stdout] test test_integer_display ... ok
[INFO] [stderr]      Running unittests src/lib.rs (/opt/rustwide/target/debug/deps/picochat_train-8a230e4aff520307)
[INFO] [stdout] test test_math_functions ... ok
[INFO] [stdout] test test_operator_precedence ... ok
[INFO] [stdout] test test_power ... ok
[INFO] [stdout] test test_unary_minus ... ok
[INFO] [stdout] test test_unknown_function ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 13 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stderr]      Running tests/checkpoint_test.rs (/opt/rustwide/target/debug/deps/checkpoint_test-c45201fda158a60a)
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_save_and_load_config ... ok
[INFO] [stdout] test test_save_and_load_roundtrip ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 54.15s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/grpo_test.rs (/opt/rustwide/target/debug/deps/grpo_test-800fd32bd5207b25)
[INFO] [stdout] 
[INFO] [stdout] running 5 tests
[INFO] [stdout] test test_compute_clipped_objective ... ok
[INFO] [stdout] test test_grpo_config_defaults ... ok
[INFO] [stdout] test test_compute_kl_penalty ... ok
[INFO] [stdout] test test_normalize_advantages ... ok
[INFO] [stdout] test test_normalize_advantages_all_same ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 5 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/metrics_test.rs (/opt/rustwide/target/debug/deps/metrics_test-b0d0a8aa148f0f0d)
[INFO] [stdout] 
[INFO] [stdout] running 4 tests
[INFO] [stdout] test test_mfu ... ok
[INFO] [stdout] test test_throughput ... ok
[INFO] [stdout] test test_tracker_accumulation ... ok
[INFO] [stdout] test test_bpb_basic ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 4 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/pretrain_test.rs (/opt/rustwide/target/debug/deps/pretrain_test-67bff438fdf57223)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_pretrain_tokens_per_step ... ok
[INFO] [stdout] test test_pretrain_config_defaults ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/rewards_test.rs (/opt/rustwide/target/debug/deps/rewards_test-1fad8e2466dbdc6a)
[INFO] [stdout] 
[INFO] [stdout] running 20 tests
[INFO] [stdout] test test_extract_final_answer_mc_from_choices ... ok
[INFO] [stdout] test test_accuracy_reward_math_correct ... ok
[INFO] [stdout] test test_accuracy_reward_math_wrong ... ok
[INFO] [stdout] test test_accuracy_reward_mc_correct ... ok
[INFO] [stdout] test test_extract_final_answer_math ... ok
[INFO] [stdout] test test_extract_final_answer_mc ... ok
[INFO] [stdout] test test_format_reward_missing_think ... ok
[INFO] [stdout] test test_composite_reward ... ok
[INFO] [stdout] test test_format_reward_valid ... ok
[INFO] [stdout] test test_length_penalty ... ok
[INFO] [stdout] test test_format_reward_think_after_answer ... ok
[INFO] [stdout] test test_format_reward_malformed_think ... ok
[INFO] [stdout] test test_format_reward_think_but_no_answer ... ok
[INFO] [stdout] test test_strip_think_blocks ... ok
[INFO] [stdout] test test_strip_think_blocks_multiple ... ok
[INFO] [stdout] test test_strip_think_blocks_none ... ok
[INFO] [stdout] test test_tool_use_reward_correct_and_useful ... ok
[INFO] [stdout] test test_tool_use_reward_correct_syntax_but_wrong ... ok
[INFO] [stdout] test test_tool_use_reward_no_tool_not_needed ... ok
[INFO] [stderr]      Running tests/sft_test.rs (/opt/rustwide/target/debug/deps/sft_test-88cca8b73a660836)
[INFO] [stdout] test test_tool_use_reward_no_tool_when_needed ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 20 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_sft_config ... ok
[INFO] [stdout] test test_masked_cross_entropy_all_masked ... ok
[INFO] [stdout] test test_masked_cross_entropy_basic ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/trainer_test.rs (/opt/rustwide/target/debug/deps/trainer_test-d646038b92e47780)
[INFO] [stdout] 
[INFO] [stdout] running 2 tests
[INFO] [stdout] test test_loss_decreases_over_steps has been running for over 60 seconds
[INFO] [stdout] test test_single_train_step has been running for over 60 seconds
[INFO] [stdout] test test_single_train_step ... ok
[INFO] [stdout] test test_loss_decreases_over_steps ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 240.04s
[INFO] [stdout] 
[INFO] [stderr]      Running tests/value_head_test.rs (/opt/rustwide/target/debug/deps/value_head_test-141b7415c7b8809e)
[INFO] [stdout] 
[INFO] [stdout] running 3 tests
[INFO] [stdout] test test_value_head_forward_shape ... ok
[INFO] [stdout] test test_value_head_output_is_scalar_per_sample ... ok
[INFO] [stdout] test test_value_head_mse_loss ... ok
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 3 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.01s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_core
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_data
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_engine
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_eval
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_optim
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_serve
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_tokenizer
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_tool
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] [stderr]    Doc-tests picochat_train
[INFO] [stdout] 
[INFO] [stdout] running 0 tests
[INFO] [stdout] 
[INFO] [stdout] test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
[INFO] [stdout] 
[INFO] running `Command { std: "docker" "inspect" "4e72ac9e64b4e8a3cf48c46c5d85e82ac7c3b5820107818548538845b2920f27", kill_on_drop: false }`
[INFO] running `Command { std: "docker" "rm" "-f" "4e72ac9e64b4e8a3cf48c46c5d85e82ac7c3b5820107818548538845b2920f27", kill_on_drop: false }`
[INFO] [stdout] 4e72ac9e64b4e8a3cf48c46c5d85e82ac7c3b5820107818548538845b2920f27
